Geometric Methods for Optical Character Recognition a Dissertation Presented by Abstract of the Dissertation Geometric Methods for Optical Character Recognition

نویسندگان

  • George N. Sazaklis
  • Joseph S. B. Mitchell
چکیده

of the Dissertation Geometric Methods for Optical Character Recognition by George N. Sazaklis Doctor of Philosophy in Computer Science State University of New York at Stony Brook Advisor: Joseph S. B. Mitchell 1997 Abstract Optical Character Recognition (OCR) is an important problem having both theoretical and practical interest. In this dissertation, we present solutions to three problems within the area of OCR. A di culty encountered by many OCR systems is confusions between similar shapes, when exible matching is employed as a primary recognition mechanism. Our solution, constrained matching as a second stage classi cation technique, can discriminate between similar shapes, using shape geometric attributes; thus the system is enabled to reach a nal decision on the character identity. Another important problem in OCR is the fast and reliable xed-font recognition. We present a hierarchical classi cation technique that utilizes the concept of geometric probe trees from [4]. At each node of the probe tree, a geometric probe collects information from the shape at hand, and makes a partial decision about its identity, eliminating certain candidates from further consideration. The probe tree can be constructed o -line in a preprocessing step and can provide us with high speed recognition for a xed font. ii We also present an extension of geometric probes, the pre x probing technique that solves the important practical problem of touching characters for a xed font. Pre x probing is based on a forest of probe trees that succeeds in identi ing each member from a sequence of touching characters independently of its neighbors. This avoids the excessive demands of a straightforward solution. To substantiate our methods, we have developed several tools, and we have done experiments with scanned documents, so we also present our experimental results. iii

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Characters in Scenes by Using Geometric Contexts of Local Features

If you can get useful information from a character image in scene you take with your camera, all you have to do is release the shutter and you can save time. In order to realize such a system, we have to propose a character recognition system such that it can correctly find the character area from the scene image and recognize it. In this paper, we propose two methods to achieve it, which consi...

متن کامل

Image Preprocessing For Geometric Feature Extraction in OCR Systems

Optical character recognition (OCR) is one of the most successful application of pattern recognition and image processing. Character geometry is one of the most useful feature for identifying characters in images. The geometric feature extraction techniques proposed in literature are complex and requires extensive effort in implementation. In this paper, we propose a preprocessing technique whi...

متن کامل

Automated Labeling from Biomedical Journals published in Foreign Languages

An automated labeling (AL) module is developed to produce bibliographic records such as English title, vernacular title, author, affiliation, and English abstract from biomedical articles published in foreign language journals. Optical character recognition (OCR) output from scanned biomedical journals is used in this labeling process. Since frequently occurring words in a zone are important fe...

متن کامل

Transcript mapping for handwritten Chinese documents by integrating character recognition model and geometric context

Creating document image datasets with ground-truths of regions, text lines and characters is a prerequisite for document analysis research. However, ground-truthing large datasets is not only laborious and time consuming but also prone to errors due to the difficulty of character segmentation and the large variability of character shape, size and position. This paper describes an effective reco...

متن کامل

Geometric Probing and Testing - A Survey

Geometric probing is the area of computational geometry that studies how to identify, verify, or determine some property of an unknown geometric object using a measuring device known as a probe. It has applications in the areas of robotics, automated manufacturing, computer vision, optical character recognition and tomography. Geometric testing is the subarea of geometric probing that studies t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997